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Abstract 

The Shannon capacity of a graph G is the maximum asymptotic rate at which messages can 
be sent with zero probability of error through a noisy channel with conf usability graph G. This 
extensively studied graph parameter disregards the fact that on atomic scales, Nature behaves 
in line with quantum mechanics. Entanglement, arguably the most counterintuitive feature of 
the theory, turns out to be a useful resource for communication across noisy channels. Recently, 
Leung, Mancinska, Matthews, Ozols and Roy [Comm. Math. Phys. 311, 2012] presented two 
examples of graphs whose Shannon capacity is strictly less than the capacity attainable if the 
sender and receiver have entangled quantum systems. Here we give new, possibly infinite, 
families of graphs for which the entangled capacity exceeds the Shannon capacity. 

1 Introduction 

A sender transmits a message to a receiver. The main problem that information theory addresses 
is that noise could make the sender's announcement ambiguous. To analyze this problem, one 
models a noisy communication channel by an input alphabet S, an output alphabet 1Z and a set of 
conditional probabilities P(a\x) for the probability that the receiver gets the letter a G 1Z when the 
sender transmits the letter x G S. Two input letters x and y can then be confused with one another 
if they can lead to the same signal on the receiver's end of the channel, that is, if there is an output 
letter a such that both the probability P(a\x) of the receiver getting a when the sender sent x, and 
the probability P(a\y) of the receiver getting a when the sender sent y, are nonzero. To cope with 
noise, the communicating parties could agree that the sender uses only input letters that lead to 
distinct signals on the receiver's end, in which case the sender restricts to some set T C S such 
that for every pair of distinct inputs x,y G T and a G TZ, at least one of the probabilities P(a\x) 
and P(a\y) vanishes. In a celebrated paper, Shannon [Sha56] initiated the study of the zero error 
capacity, the maximum rate of error-free communication with sequential uses of a memoryless 
channel. A channel is memoryless if using it does not change its behavior on later messages. 
By encoding messages into words consisting of multiple input symbols that are transmitted in 
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Figure 1: (i): A channel with five inputs (numbers) and five outputs (letters), where an input 
symbol x is connected to an output symbol a if P(a\x) > 0. Notice that no two of the five pairs 
(0,2), (1,4), (2, 1), (3,3), (4,0) can be confused with one another, as either the first or the second 
two symbols are non-conf usable, (ii): the confusability graph of the channel, the five cycle C5. 



sequence, a channel can sometimes be used more efficiently than if non-confusable symbols are 
simply concatenated. Shannon demonstrated this with a famous example of a channel with five 
inputs and outputs. At most two symbols can be sent perfectly with one use of the channel. 
Instead of the expected four, five messages can be sent perfectly with two uses of the channel (see 
Figurejl} (i)). 

Associated with a noisy channel is its confusability graph G = (V, E), with as vertex set V 
the input alphabet of the channel and edge set E consisting of those pairs of inputs that can be 
confused (see Figure [TJ (ii)). The largest number of messages that can be sent through a channel 
with zero probability of error equals <x(G), the maximum cardinality of an independent set in 
the graph G. The graph G^ n has as vertex set V n , the set of all n-tuples of elements from V. 
Two different vertices [u\, . . . ,u n ) and {v\, . . . ,v n ) are adjacent in G^ n if and only if they can be 
confused (i.e., if for every i = 1, . . . , n, either U{ = V\ or u, is adjacent to i?, in G). The independence 
number a(G K ") thus gives the maximum number of pairwise non-confusable messages that can 
be sent using n-letter codewords. The Shannon capacity of the confusability graph, 

0(G) = sup {jot{G® n ) = lim ^ct(G m ), 

gives the zero-error capacity of a channel. 

Inevitably, devices used for information processing are subject to the laws of physics, and on 
atomic scales quantum mechanics is currently the most accurate model of Nature. Thirty years 
before Shannon's paper was published, Einstein, Podolsky and Rosen [EPR35| pointed out an 
anomaly of quantum mechanics that allows spatially separated parties to establish peculiar cor- 
relations: entanglement. Later, Bell [Bel64] proved that local measurements on a pair of spatially 
separated, entangled quantum systems, can give rise to joint probability distributions that violate 
certain inequalities (now called Bell inequalities) satisfied by any distribution that may arise in 
classical mechanics. Experimental results of Aspect et al. [AGR8IJ give strong evidence that Na- 
ture indeed allows distant physical systems to be correlated in such non-classical ways. Motivated 
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by these important discoveries, one defines the zero-error entanglement-assisted capacity of a classi- 
cal channel to be the maximum rate at which messages can be transmitted when the sender and 
receiver share a pair of entangled quantum systems (the precise model is described below). 

Analogous to the classical setting, Cubitt, Leung, Matthews and Winter ICLM WlOl defined 
the graph parameter otq(G) (a quantum variant of the independence number), and proved that 
it equals the maximum number of pairwise non-confusable messages that can be sent with a 
single use of a noisy channel and shared entanglement. They found examples of graphs for 
which otq(G) > tt(G), showing that the use of entanglement can increase the "one-shot" zero- 
error capacity of a channel (more examples where found recently by Mancinska, Scarpa and Sev- 
erini |MSS12|). This result was seen as a surprise in light of the fact that entanglement cannot in- 
crease the transmission rate if one only demands that the probability of confusion goes to zero with 
the number of uses of the channel, as was proved by Bennett, Shor, Smolin and Thapliyal [BSST02 1. 
Of course, the next question was if the entangled capacity, defined by 



©,(G) = lim a 0Cq(G m ), 

could be strictly greater than the Shannon capacity. 

In contrast to the combinatorial nature of the Shannon capacity, the entangled capacity can 
sometimes be lower bounded using geometric constructions. An orthonormal representation of a 
graph is a map that sends each of the vertices of the graph to a vector on a Euclidean unit sphere, 
such that adjacent vertices are sent to orthogonal vectors^ It turns out that if a graph G has a d- 
dimensional orthonormal representation, then a.q(G), and therefore ? (G), is at least the number 
of disjoint d-cliques in G (see Proposition [7b. 

Using the above argument, Leung, Mancinska, Matthews, Ozols and Roy [LMM + 12J recently 
found the first two examples of graphs whose entangled capacity exceeds the Shannon capacity. 
Their graphs are based on the exceptional root systems £7 and Eg, giving a graph Ge 7 on 63 vertices 
such that 0(j(Ge 7 )/®(Ge 7 ) = 9/7andagraph Ge s on 157 vertices such that (? (G£ 8 )/®(Ge 8 ) = 
15/9. They also show that their construction fails on any of the (four) infinite families of root 
systems, in the sense that it yields graphs whose Shannon and entangled capacities are equal. 

The above-described lower-bound technique perhaps makes the orthogonality graph — whose 
vertices are the binary strings of length n, and whose edges are all pairs with Hamming dis- 
tance n/2 (for n even) — the most natural candidate to separate the two capacities. This graph lies 
at the heart of many constructions that show a separation between some classical quantity and its 
quantum analogue, such as in communication complexity [BCW98. BCT99|, in Bell-inequality vi- 
olations [AHKS061 ICMN+071 IGN08I and in the "one-shot" zero-error capacity of a noisy channel 



(where one compares oc(G) to a.q(G)). 

By using the {— 1, 1} -basis for bits, one directly obtains an n-dimensional orthonormal repre- 
sentation for the orthogonality graph, since two strings have Hamming distance n/2 (meaning 
they are adjacent) if and only if they are orthogonal. A Hadamard matrix is a square matrix with 



1 We stress that in our definition orthogonality corresponds to adjacency. Some authors prefer to demand orthogonality 
for non-adjacent vertices instead. 
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entries in {— 1, 1} such that its rows are mutually orthogonal. If a Hadamard matrix of size n ex- 
ists, then its rows thus give a clique of size n in the orthogonality graph. The fact that this graph is 



vertex transitive then implies that it in fact has at least 2"/n 2 disjoint cliques (see Lemma 11 and 
Proposition [16] below), giving the same lower bound on its entangled capacity. It is well known 
that Hadamard matrices exist when n is a power of 2 and the famous Hadamard conjecture states 
that they exist whenever n is a multiple of 4. Although this conjecture remains unproven, it is 
widely believed to be true. 

An indication that the orthogonality graph might exhibit a separation between the Shannon 
and entangled capacities is given by a deep result of Frankl and Rodl [FR87| showing that if n is 
a large enough multiple of 4, then the independence number is less than (2 — e) n for some e > 
independent of n. But despite effort from the quantum-information community, it remains un- 
known if this graph gives such a separation. Our main result shows that under certain conditions 
such a result does hold for a "quarter" of the orthogonality graph. 

2 Our results 

In this paper we present two new, possibly infinite, families of basic graphs whose entangled 
capacity exceeds the Shannon capacity. The graphs are defined as follows. Let H n be the graph 
with as vertex set all binary strings of odd length n and even Hamming weight, and as edge set 
the pairs with Hamming distance (n + 1) / 2. Let G n be the subgraph of H n induced by the strings 
of Hamming weight (n + l)/2. We prove the following. 

Theorem 1 Let p be an odd prime such that there exists a Hadamard matrix of size 4p. Then, for n = 
4p — 1 and G either G n or H„, we have 

0(G) - V p5/2 

Notice that the graph H n is a subgraph of the orthogonality graph on {0, 1}" +1 , induced by 
the (n + 1 ) -bit strings with even Hamming weight and first coordinate equal to 0. Based on con- 
structions of Hadamard matrices due to Scarpis [Sca98j and Paley HPal33| , Theorem [I] holds for 
any prime p such that 4p — 1 = q for some odd prime q and positive integer k. The first three 
examples of such (p,q) pairs for k = 1 are (3,11), (5,19) and (11,43). Examples for p m 10 12 and 
k = 1 can readily be generated with little computing power. The subset of strings in the vertex set 
of G n that have zeroes on the last (n — 7)/4 coordinates is an independent set of size Q(2°- 29n ), 
showing that the Shannon capacity of G n is exponential in n. Since G n is an induced subgraph of 
H n , the same holds for the latter graph. 

We observe that the results of ||LMM + 12B imply that the sequence of graphs (Gf") nG N has a 
capacity ratio 0^/0 that grows as roughly |V(Gf g ")| 0101 , where V(G) is the number of vertices 
of the graph G (the graph Ge 8 gives better dependence on the number of vertices than Ge 7 does). 
Our results show that the family of graphs G„ gives a slightly higher ratio of roughly | V(G„) | 0187 . 

Theorem [I] follows directly from the following lemmas, which give lower and upper bounds 
on the entangled capacity and Shannon capacity, respectively. 
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Lemma 2 Let nbe a positive integer such that there exists a Hadamard matrix of size (n + 1). Then, for 
G either G n or H n , we have q (G) > \V(G)\/(n + l) 2 . 

Lemma 3 Let p be an odd prime and let n = 4p — 1. Then, for G either G n or H n , we have 

e (G )<® + ( ; ) + ... + '» 

Aside from relying on a few basic facts of graph theory and the theory of finite fields, our 
proofs of these lemmas are straightforward and self-contained. To obtain the asymptotic bound of 
Theorem [l] we upper bound the binomial sum of Lemma [3]by the well-known estimate 2 nH (P' /n \ 
where H(f) = — Hog 2 1 — (1 — t) log 2 (l — t) is the binary entropy function, and use the bound 
\V(G n )\ > Q(2'Vv / ")- 



3 The entangled capacity of a graph 

In this section we give the formal definition of the entangled capacity of a graph. Let G be a finite 
simple undirected graph, with vertex set V(G) end edge set E(G) C {^). The strong graph product 
G Kl H of two graphs G and H has as vertex set all pairs (u, v) £ ^(G) x V(H). Two vertices 
(u,v) and (u',v') are adjacent in G Kl H if and only if u and u' are adjacent in G or equal, and 
v and v' are adjacent in H or equal. For example, if {u,u'} S £(G) and {v, v'} £ E(H), then 
the four pairs (u, v), (u,v'), (u',v) and (u',v') form a 4-clique in G Kl H^] We denote by G^ n the 
n-fold strong graph product of G with itself. Recall that the Shannon capacity of G is defined as 
0(G)=sup, I ( a (G^)) 1/ ". 

The entangled capacity of a graph is defined as follows. 

Definition 1 (Entangled capacity of a graph) Let G = (V,E) be a graph. Define 0L q (G) to be the 
largest natural number M such that there exists a Hilbert space T-L, a trace-1 positive semidefinite operator p 
on TL and a positive semidefinite operator p(u)j on T-L for every i 6 {1, . . . , M} and u S V satisfying 

1- LuGVp( u )i = p for every i G {1,...,M} 

2. p(u)ip(u)j = Ofor every u G V and i ^ j 

3. p(u)ip(v)j = if {u,v} £ E and i ^ ;'. 
The entangled capacity of G is defined by 

© q (G)=sup(oc q (G m )) 1/n . 

n 

2 Hence the symbol E3. 
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The parameter a 9 (G) satisfies etq{G) > oc(G) and is a generalization of the independence num- 
ber. To see this, restrict in Definition [T] the space H to be one dimensional and add the further 
restrictions p = 1 and p(u)i G {0,1}. Say that a vertex u G V gets label i if p(u)i = 1. Con- 
dition 1 says that exactly one vertex gets label i, Condition 2 says that each vertex gets at most 
one label and Condition 3 says that no two adjacent vertices belong to the privileged subset of 
labeled vertices. Hence, the system of numbers p(u)j gives an independent set of size M, namely 
the set {u : p(u)i = 1 for some z}. Since ix.q(G) relaxes this characterization of a (G), it follows that 
oc q {G)>oc{G). 

By using tensor products of the operators p and p(u)j it is not hard to see that cc q is super- 
multiplicative under strong graph powers. It follows that ©o(G) > OLq (G) and (by Fekete's lemma) 
that©,(G) = lim^eo («,(G Bte )) 1/B . 

4 Entanglement-assisted communication 

In this section we describe the model of zero-error entanglement-assisted communication over 
classical channels. Readers who are familiar with this model or want to move on to the proof 
of the main result can safely skip this section. We start with some basic definitions of quantum 
information theory. For more details we refer to Nielsen and Chuang [NC00J. 

Shared entangled states. A state is a positive-semi definite matrix whose trace equals 1. We iden- 
tify a matrix of size d x d with a linear operator on C d in the obvious way. A state should be though 
of as describing the configuration of a quantum system: an abstract physical object, or collection of 
objects, on which one can perform experiments. Associated with a quantum system Q is a com- 
plex Euclidean vector space Q = C d , for some dimension d. The possible configurations of Q are 
the states on Q. 

Suppose the sender and receiver hold quantum systems S and R, respectively. Associated 
with the sender's system is a space X = C n , and associated with the receiver's system is a space 
y = C m . Then, by definition, the possible configurations of the joint system (S, R) are the states 
on X CS> y. If the system (S, R) is in the state p, then the sender and receiver are said to share the 
state p. A state on X <g> y is entangled if it is not a convex combination of states of the form p$ <8> Pr, 
where p$ is a state on X and p% a state on 3^- 

Measurements. Let S be a finite set and let Q be a quantum system with associated vector space 
Q = C d . A measurement on the system Q with outcomes in S is a system of positive semidefinite 
matrices M s on Q, s G S, that satisfies 

E MS = l Q' 

seS 

where Iq denotes the identity on Q. 

Let {A s G C nxn : s G S} be a measurement on the sender's quantum system S. The numbers 

p s = Tr((A s ®I y )p) 
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define a probability distribution on S. This follows easily from the properties of the matrices A s 
and p and the fact that for positive semidefinite matrices A and B, we have Tr(AB) > 0. 
The partial trace function over X of a matrix M on A" <g> y is defined by 

Tr*(M) = (ex <g> 73;) T M(>i <g> Iy) H h (e„ ® J;);) T M(e n <g> Jy), 

where e\, . . . , e n are the canonical basis vectors for X . This function yields an mxm matrix (i.e., a 
linear operator on the space 3^)- It is not hard to see that each of the matrices 

_ 

Ps 

are in fact states on the space y associated to the receiver's system R. 

The postulates of quantum mechanics dictate that if the sender -performs the measurement defined 
by the matrices A s on her system S, then the following two things happen: 

1. she obtains outcome s 6 S with probability p s , 



2. the receiver's system R is left in the state p(s) on y. 

For some finite set TZ, the receiver can perform a measurement {B* G C mxm : t G on R, 
and he will obtain outcome t with probability Tr (B'jO(s)) . The joint probability of the sender and 
receiver obtaining outcomes s and t, respectively is then given by Tr ((A s <g> B*)p). If the state p 
is not entangled, then this probability distribution can be described by a classical local-hidden- 
variable model. Entanglement is thus necessary to obtain non-classical, quantum distributions. 

Entanglement-assisted communication. To send messages across a noisy channel defined by 
input alphabet S, output alphabet TZ and conditional probability distribution P, the sender and 
receiver can use shared entanglement as follows. Let p be a state shared between the sender and 
receiver. Let M be a positive integer and for every i G {1, . . . ,M} let {A- G C" xn : s G S} be a 
measurement on the sender's system S with outcomes in S. For every t G 1Z, let {B ! t G C mxm : t G 
1Z} be a measurement on the receiver's system R with outcomes in {1, . . . ,M}. Suppose that for 
every i ^ j and (s, t ) G S x 1Z such that P(t |s) ^ 0, we have 

Tr ((A?®Bi)p) =0. 

To communicate the index i, the sender can then perform the zth measurement on her system 
and sends her outcome s G S through the channel. The receiver gets a message t G 1Z satisfying 
P{t\s) 7^ 0. The above discussion shows that if the receiver then performs the measurement 
labeled by t, he obtains outcome i with probability 1. 

The link between this model and Definition [T] is given by the following theorem. Let us denote 
by &'q(S, TZ, P) the maximum number M such that a state p and matrices A s jr B\ with the above 
property exists. 

Theorem 4 (Cubitt, Leung, Matthews and Winter [CLMW10 D Let (S, TZ, P) be a noisy channel and 
let G be its confusability graph. Then, cc'^ (S, TZ, P) = cc q (G). 
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5 Preliminaries 



Notation. We use the following notation. 

• For strings x,y E {0, l} n , let d(x,y) denote their Hamming distance. 

• For vectors u, v G R", let u ■ v denote their Euclidean inner product. 

• For a prime number p, we write F p for a finite field consisting of p elements. 

• For vectors u,v G F p , let (u,v) denote their inner product over F p . 

• For a field F, we denote by F[vi, ...,v n \ the ring of n-variate polynomials with coefficients 
inF. 

Some basic graph theory. Let G be a graph. A permutation of the vertices n : V(G) — > V(G) 
is an automorphism of G if for every G V(G), the pair {7t(m), tc(v)} is an edge if and only if 
{u,v} is an edge. Let Aut(G) denote the group of automorphisms of G. For u G V(G), the set 
Orb(u) = {7r(u) : n G Aut(G)} is the orbit of « and the set Stab(u) = {71 G Aut(G) : 7t(m) = u} 
is the stabilizer of u. 

Definition 2 A grapfo G is vertex transitive z//or every vertex u G V(G), we have Orb(w) = V(G). 

Lemma 5 (The Orbit-Stabilizer Theorem |AB95|) Let G be a graph and u G V(G). Tfoen 

I Orb(u) I • I Stab(w) | = | Aut(G) |. 

Corollary 6 Let Gbea vertex transitive graph and u,v G V(G). T/zen, ffoere are exactly | Aut(G) | / 1 V(G) | 
automorphism ofG that map u to v. 

PROOF: Since G is vertex transitive, there exists an automorphism g G Aut(G) such that g{u) = v. 
Consider the set of automorphisms g ■ Stab(u) = {gh : h G Stab(u)}- Clearly g'(u) = v for every 
g' G g ■ Stab(u). We claim that g ■ Stab(u) contains all automorphisms that map u to v. To see this, 
notice that for any g" G Aut(G) such that g"(u) = v, we have g~ l g" G Stab(w) and hence g" = 
g{g~ r g") G g ■ Stab(u). Since gh = gh' implies that h = h' , we have \g ■ Stab(u)| = | Stab(w)|. The 
claim follows because, by the Orbit-Stabilizer Theorem, we have | Stab(u) | = | Aut(G) | / 1 Orb(w) |, 
and by vertex transitivity of G, we have | Orb(w) | = | V{G) \ . □ 

6 Lower bounds on the entangled capacity 

In this section we lower bound the entangled capacity of the graphs G„ and H n . We start by 
dealing with the graph G n . The graph H n will afterwards be treated in a similar manner. 

To prove the lower bounds we use a straightforward general method which was also used 
before in I CLMW10 , |LMM + 12| . Recall that a (real) d-dimensional orthonormal representation of 
a graph G is a mapping / : V(G) — > R d satisfying f(u) ■ f(u) = 1 and f(u) ■ f(v) = for every 
{u,v} G E(G). 
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Proposition 7 If a graph G has an orthonormal representation f : V(G) — > R d and and has M disjoint 
d-cliques, then & q (G) > M. 

PROOF: Let {1,. . . ,M} be a label set for the disjoint cliques. Let p = I/d where I is the d-by- 
d identity matrix. For every u G V and i G {1,. . . ,M} let p(u)j = f(u)f(u) T /d if u belongs 
to the i th clique and let p(u)j be the zero matrix otherwise. Clearly these matrices are positive 
semidefinite and it is easy to check that they satisfy the conditions of Definition [I] using the fact 
that for every d-clique C Q V, the set {f(u) } u eC is a complete orthonormal basis for R rf . This gives 
© 9 (G) > a q {G) > M. □ 

The lower bounds on the entangled capacity given in Lemma [2] follow immediately from the 
following two lemmas and Proposition [7| 

Lemma 8 Let n be an odd integer. Then, the graph G n has an n-dimensional orthonormal representation. 

Lemma 9 Let n be such that there exists a Hadamard matrix of size n + 1. Then, the graph G n has at least 
| V(G„) | /n 2 disjoint cliques of size n. 

We proceed by proving these lemmas. 

PROOF OF Lemma[8J Associate with every vertex x = {x\, . . . ,x n ) G V a sign vector given by 
u[x] = ((— l) Xl , . . . , (— 1) X ") T 6K". Let 1 denote the ft-dimensional all-ones vector. Note that for 
every x G V, we have u[x] -1 = —1, as the Hamming weight of x is (ft + 1) / 2. Moreover, for every 
{x,y} G E we have u[x] ■ u[y] = —1, which follows from the fact that d(x,y) = (n + l)/2. 

Now consider the (n + 1) -dimensional unit vectors f(x) = (u[x] © 1)/ y/n + 1 (i.e., the column 
vector u[x] with a 1 appended to it, normalized). These vectors satisfy: 

1. for every {x,y} G E we have 

m-m = iu[x] ®y + { " M(B1) =q, 

2. for every x G V we have 

/W . (1 ei ) = Mfl|l).( 1 e 1 ) = zl±i = o. 

\/n + l Vn + 1 

The first item shows that / forms an orthonormal representation of G. The second item says 
that the vectors {f{ x )) xGV He on a single n-dimensional hyperplane (orthogonal to the all-ones 
vector). Hence these vectors span a space of dimension at most ft. It follows that there is an 
n-dimensional orthonormal representation of G„. □ 

To prove Lemma [9] we need to find a large number of disjoint ft-cliques in G n . We achieve this 
by first finding just one ft-clique. Using the fact that G„ is vertex transitive, we show that existence 
of a single clique implies the existence of many disjoint cliques. More explicitly, one can produce 
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many pairwise disjoint n-cliques by simultaneously permuting the coordinates of the strings in 
this one clique. Notice that this permutation operation leaves both the Hamming weights and 
the Hamming distances invariant. A suitable choice of such permutations give pairwise disjoint 
cliques from any single clique, as whether or not a set of n-bit strings forms a clique in G n depends 
only on their Hamming weights and Hamming distances. 

The following proposition tells us when we can find a single n-clique in G n . 

Proposition 10 Let n be such that there exists a Hadamard matrix of size (n + 1). Then, there exists an 
n-clique in G n . 

PROOF: Let M be an (n + 1) x (n + 1) Hadamard matrix. We may assume that the first row and 
column of M contain only +ls, since multiplying all entries in a row (or column) by — 1, gives 
again a Hadamard matrix. Since each of the last n rows of M is orthogonal to the first row, it 
has exactly (n + l)/2 entries equal to —1. Moreover, since each pair from the last n rows of M is 
orthogonal, the two rows differ in exactly (n + l)/2 coordinates. 

Let C be the the nxn matrix obtained by removing the first row and column from M. Then, 
each row of C has exactly (n + l)/2 entries equal to —1, and every pair of rows from C differ in 
exactly (n + 1) / 2 coordinates. Hence, the rows of C are a clique in G n . □ 

Next, we lower bound the number of disjoint n-cliques of size n in G n . We use the following 
lemma and proposition. 

Lemma 11 Let G be a vertex transitive graph that has a d-clique as an induced subgraph. Then, G has at 
least \ V{G~)\I d 1 vertex-disjoint induced d-cliques. 

PROOF: Let W = C\ U C2 U • • • U Cfc be a union of k disjoint d-cliques, with k maximal. Since 
G is vertex transitive, Corollary [6] implies that for every pair of vertices u, v there are exactly 
I Aut(G) I / 1 V(G) I automorphisms mapping u to v. It follows that at most 

\W\ • |Ci| • I Aut(G)|/|V(G)| 

automorphisms map a vertex in C\ to a vertex in W. 

On the other hand, by maximality of k, o~{C\) fl W is nonempty for every automorphism a. It 
follows that \W\ ■ |d I > and hence A: = |W|/|Ci| > |V|/|Ci| 2 = \V\/d 2 . □ 

Proposition 12 For every n, the graph G„ is vertex transitive. 

PROOF: Consider the group S n of permutations on {1, ... , n}. For every a € S„ define the map 
: {0, 1}" —> {0, 1}" by T a (x) = (x^i), . . . ,x a ^). As T a leaves the Hamming weight invariant, 
we have T a : V(G„) -> V(G n ). Moreover, A (IV [x),T ff (y) ) = A{x,y). Hence, T a e Aut(G„). 
Finally, for every x G V(G„) we have {T a (x) : a G S n } = V(G„), and we are done. □ 



PROOF OF LEMMA [9j The result follows by combining Propositions 10 and 12 and Lemma 11 □ 
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We deal with the graphs H n in the same way as we did with the graphs G n . We directly obtain 
the result of Lemma|2]for these graphs by combining the following two lemmas with Proposition]?] 

Lemma 13 Let n be an odd integer. Then, H n has an orthonormal representation of dimension n + 1. 

Lemma 14 Let n be such that there exists a Hadamard graph of size n. Then, the graph H n has at least 
\V(H n )\/(n + l) 2 disjoint cliques of size n + 1. 

PROOF OF lemma[T3J Associate with every vertex x = (x\, . . . , x n ) 6 V the vector 

«[*] = ((-!)* (-1)*) T . 



Then, the unit vectors f(x) = (u[x] © 1) / ' V n + 1 form an (n + 1) -dimensional orthonormal repre- 
sentation of H n . □ 

To prove Lemma 14 we proceed as in the proof of Lemma|9] We first find a single (n + 1 )-clique 



in H n . Then we prove that H n is vertex transitive and use Lemma 11 



Proposition 15 Let n be such that there exists a Hadamard matrix of size (n + 1). Then, there exists an 
(n + l)-clique in H n . 

PROOF: Let C be an n-clique in the graph G n . Then, since each of the vertices in C has Hamming 
weight (n + l)/2, the union of C and the all-zeroes string gives an (n + l)-clique in H n . The result 



now follows from Proposition 10 □ 



Proposition 16 Tor every n, the graph H n is vertex transitive. 

PROOF: Recall that V(H n ) C FiJ consists of the strings of even Hamming weight. For every 
z G V(H n ) define the linear bijection E 2 : F£ — > ^ by E 2 (x) = x + z. As E z leaves the parity of 
Hamming weight invariant, we have E 2 : (H n ) — > y(H„). Moreover, A (E 2 (x),E z (i/)) = A(x,y). 
Hence, E 2 G Aut(G n ). For every x G V(H„) we have {E 2 (x) : a G z G V(H n )} = V(H n ), and we 
are done. □ 



PROOF OF LEMMA [14J The result follows by combining Propositions 15 and 16 and Lemma 11 □ 



7 Upper bounds on the Shannon capacity 

In this section we upper bound the Shannon capacity of the graphs G n and H n . We recall that G„ 
has as vertex set all binary strings of odd length n and Hamming weight (n + l)/2, and as edge 
set the pairs of vertices with Hamming distance (n + l)/2. The graph H n has as vertex set all 
binary strings of odd length n and even Hamming weight, and as edge set the pairs of vertices 
with Hamming distance (n + 1) II. The proof of the upper bounds in Lemmas [3] is based on a 
general method of Haemers [Hae78| and an algebraic lemma of Frankl and Wilson [FW81 1. 
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Lemma 17 (Haemers IIHae78l ) Let G = (V, E) be a graph. Let F be afield. Let A : V x V -¥ F be a 

matrix such that for every x G V we have A(x,x) ^ and for every non-adjacent pair x,y G V we have 
A(x r y) = 0. Then, 0(G) < rank(A). 

PROOF: Say that a matrix A : V x V — > F fits G if it satisfies the conditions stated in the lemma. Let 
S C V be a maximum-sized independent set and let A be a matrix that fits G. Then, the principle 
submatrix of A defined by S has rank |S|. Hence, we have a(G) < rank(A). The result follows 
because A m fits G m and rank(A®") = rank(A)". □ 

We say that a polynomial is multilinear if its degree in each variable is at most 1. 

Lemma 18 (Frankl-Wilson |FW81|) Let p be an odd prime, let rbea natural number and letn = rp — 1. 
Let V C {—1,1}" C Fp be a set of vectors over F p . Then, for every u G V £/zere exists a multilinear 
polynomial P u € F p [z7i, satisfying: 

1. P u (u) £0, 

2. for every v G V suc/z tto (w, y) ^ — 1, we have P u (v) = 0, 

3. deg(P ;i ) < p - 1. 

PROOF: For every vector u G V let Q„ G F p . . . , v n ] be the polynomial defined by 

p-i 

Qu(v) = Yl{(u,v)+l-i), 

i=l 

Since n = —1 (mod p), every v G V satisfies = —1. By Wilson's Theorem (see for 

example Lidl and Niederreiter IILN83IH it follows that Q u (u) = (-l)P _1 (p - 1)! = (-l) p . If 
(u, v) 7^ — 1 we have Q u (v) = since in this case we have (u, v) + 1 G {1, . . . , p — 1 }. In particular, 
we have Q m (m) 7^ 0. Since v i-> (w, o) is a linear function, we have deg(Q„) = p — 1. 

Define the multilinear polynomial P„ by expanding Q„ in the monomial basis and changing 
the powers fj the monomial v} ■ ■ ■ vll' to if t{ is even and to 1 if is odd. Then P„ is multilinear 
and agrees with Q u everywhere on { — 1, 1}" and satisfies deg(P t; ) < deg(Q„). □ 

We now show how these two lemmas can be combined to give Lemma |3j which states that 
for p an odd prime and n = 4p — 1, and G either G n or H n , we have ©(G) < O (2 - 812 ") . 

PROOF OF Lemma [3j Let (V,E) be either G„ or H n . For every x G Vletu[x] = ((-l) Xl , . . . , (-1) J ») 
be the corresponding sign vector in F". Since n = 4p — 1 = — 1 (mod p) and F p is isomorphic to 
the ring of integers mod p, we have for every x,y G V, 

{u[x],u[y]) =n- 2d(x,y) = -2d(x,y) - 1. (1) 
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Letx,y G V be distinct vertices such that {x,y} E is a non-edge. We claim that (u[x],M[y]) / 
— 1. It is not hard to see that if two strings have even Hamming weight, then their Hamming 
distance is also even. Hence, we have d(x,y) G {2,4, . . . ,4p — 2}. Moreover, since p is odd, the 
only possible multiple of p that 2d{x,y) can attain is 4p. Since edges are formed by pairs with 
Hamming distance 2p, we have 2d(x,y) ^ 4p. This implies that 2d(x,y) ^ (mod p), and the 
claim follows form Eq. Q. 



Set r = 4 and let V = {i/[x] : x G V}. Then, Lemma 18 gives a multilinear polynomial 
G F p [z?i ...,v n ] for every x G V, satisfying: 

1. ?*(«[*]) ^0, 



2. for every y G V such that y ^ x and {x,y} ^ £, we have P x (u[y]) = 0, 



3. deg(P*) < p - 1. 

The set of multilinear monomials in n variables of degree at most p — 1 forms a basis for the 
space of multilinear polynomials of degree at most p — 1. For every vertex jc G V, define vectors 
S [x] , T [x] G Fp^ 1 as follows. For monomial m G M let S [x] m be the coefficient of m in the expansion 
of the polynomial P x in the basis M. and let T[x] m = m(w[x]) be the value obtained by evaluation 
the monomial m at u[x]. Then, for every x,y EV we have S[x] • T[y] = P x {u[y]). 

Consider now the matrix A : V x V — > F p defined by A(x,y) = S[x] • T[y]. Since the vectors 
S[x] and T[y] have dimension |A4|, we have rank(A) < Additionally, it follows from the 

properties of the polynomials P x that the matrix A satisfies A(x,x) ^ for every x G V and 
A{x, y) = for every non-adjacent pair x, y G V. 

The claim now follows from Lemma fTz] and the fact that 



/'-I 

E 

i=0 

This completes the proof. □ 
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